Learning Compact and Effective Distance Metrics with Diversity Regularization
نویسنده
چکیده
Learning a proper distance metric is of vital importance for many distance based applications. Distance metric learning aims to learn a set of latent factors based on which the distances between data points can be effectively measured. The number of latent factors incurs a tradeoff: a small amount of factors are not powerful and expressive enough to measure distances while a large number of factors cause high computational overhead. In this paper, we aim to achieve two seemingly conflicting goals: keeping the number of latent factors to be small for the sake of computational efficiency, meanwhile making them as effective as a large set of factors. The approach we take is to impose a diversity regularizer over the latent factors to encourage them to be uncorrelated, such that each factor can capture some unique information that is hard to be captured by other factors. In this way, a small amount of latent factors can be sufficient to capture a large proportion of information, which retains computational efficiency while preserving the effectiveness in measuring distances. Experiments on retrieval, clustering and classification demonstrate that a small amount of factors learned with diversity regularization can achieve comparable or even better performance compared with a large factor set learned without regularization.
منابع مشابه
Robust Distance Metric Learning with Auxiliary Knowledge
Most of the existing metric learning methods are accomplished by exploiting pairwise constraints over the labeled data and frequently suffer from the insufficiency of training examples. To learn a robust distance metric from few labeled examples, prior knowledge from unlabeled examples as well as the metrics previously derived from auxiliary data sets can be useful. In this paper, we propose to...
متن کاملCollaborative Web Search
Most of the existing metric learning methods are accomplished by exploiting pairwise constraints over the labeled data and frequently suffer from the insufficiency of training examples. To learn a robust distance metric from few labeled examples, prior knowledge from unlabeled examples as well as the metrics previously derived from auxiliary data sets can be useful. In this paper, we propose to...
متن کاملWarped product and quasi-Einstein metrics
Warped products provide a rich class of physically significant geometric objects. Warped product construction is an important method to produce a new metric with a base manifold and a fibre. We construct compact base manifolds with a positive scalar curvature which do not admit any non-trivial quasi-Einstein warped product, and non compact complete base manifolds which do not admit any non-triv...
متن کاملImage Classification from Small Sample, with Distance Learning and Feature Selection
Small sample is an acute problem in many application domains, which may be partially addressed by feature selection or dimensionality reduction. For the purpose of distance learning, we describe a method for feature selection using equivalence constraints between pairs of datapoints. The method is based on L1 regularization and optimization. Feature selection is then incorporated into an existi...
متن کاملEffective Distance Teaching and learning in Higher Education
Nowadays, Universities have come across a main transformation. Lack of budget, an increase in the number of university students, a change in the student population, up-to-date and various educational needs of each society require fundamental changes that are coordinated with recent needs. This study aimed to evaluate the features of effective distance education in higher education. Findings of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015